Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 32412 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.2 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 14 |
arrival_date_year has constant value "2017" | Constant |
country has a high cardinality: 143 distinct values | High cardinality |
stays_in_weekend_nights is highly correlated with total_nights | High correlation |
stays_in_week_nights is highly correlated with total_nights | High correlation |
is_repeated_guest is highly correlated with previous_bookings_not_canceled | High correlation |
previous_bookings_not_canceled is highly correlated with is_repeated_guest | High correlation |
total_nights is highly correlated with stays_in_weekend_nights and 1 other fields | High correlation |
stays_in_weekend_nights is highly correlated with total_nights | High correlation |
stays_in_week_nights is highly correlated with total_nights | High correlation |
previous_cancellations is highly correlated with previous_bookings_not_canceled | High correlation |
previous_bookings_not_canceled is highly correlated with previous_cancellations | High correlation |
total_nights is highly correlated with stays_in_weekend_nights and 1 other fields | High correlation |
stays_in_weekend_nights is highly correlated with total_nights | High correlation |
stays_in_week_nights is highly correlated with total_nights | High correlation |
is_repeated_guest is highly correlated with previous_bookings_not_canceled | High correlation |
previous_bookings_not_canceled is highly correlated with is_repeated_guest | High correlation |
total_nights is highly correlated with stays_in_weekend_nights and 1 other fields | High correlation |
children is highly correlated with arrival_date_year | High correlation |
babies is highly correlated with arrival_date_year | High correlation |
arrival_date_year is highly correlated with children and 11 other fields | High correlation |
is_repeated_guest is highly correlated with arrival_date_year | High correlation |
adults is highly correlated with arrival_date_year | High correlation |
stays_in_weekend_nights is highly correlated with arrival_date_year | High correlation |
distribution_channel is highly correlated with arrival_date_year | High correlation |
required_car_parking_spaces is highly correlated with arrival_date_year | High correlation |
reserved_room_type is highly correlated with arrival_date_year | High correlation |
meal is highly correlated with arrival_date_year | High correlation |
arrival_date_month is highly correlated with arrival_date_year | High correlation |
is_canceled is highly correlated with arrival_date_year | High correlation |
customer_type is highly correlated with arrival_date_year | High correlation |
id is highly correlated with is_canceled and 3 other fields | High correlation |
is_canceled is highly correlated with id | High correlation |
arrival_date_month is highly correlated with id and 1 other fields | High correlation |
arrival_date_week_number is highly correlated with id and 1 other fields | High correlation |
stays_in_weekend_nights is highly correlated with total_nights | High correlation |
stays_in_week_nights is highly correlated with total_nights | High correlation |
children is highly correlated with reserved_room_type | High correlation |
distribution_channel is highly correlated with is_repeated_guest | High correlation |
is_repeated_guest is highly correlated with id and 1 other fields | High correlation |
previous_cancellations is highly correlated with previous_bookings_not_canceled | High correlation |
previous_bookings_not_canceled is highly correlated with previous_cancellations | High correlation |
reserved_room_type is highly correlated with children | High correlation |
total_nights is highly correlated with stays_in_weekend_nights and 1 other fields | High correlation |
previous_cancellations is highly skewed (γ1 = 23.76463483) | Skewed |
previous_bookings_not_canceled is highly skewed (γ1 = 23.46652365) | Skewed |
days_in_waiting_list is highly skewed (γ1 = 24.80627415) | Skewed |
id has unique values | Unique |
lead_time has 1376 (4.2%) zeros | Zeros |
stays_in_week_nights has 1937 (6.0%) zeros | Zeros |
previous_cancellations has 32186 (99.3%) zeros | Zeros |
previous_bookings_not_canceled has 31362 (96.8%) zeros | Zeros |
booking_changes has 27745 (85.6%) zeros | Zeros |
days_in_waiting_list has 32235 (99.5%) zeros | Zeros |
total_of_special_requests has 17338 (53.5%) zeros | Zeros |
Reproduction
| Analysis started | 2022-07-06 05:04:12.309897 |
|---|---|
| Analysis finished | 2022-07-06 05:04:41.250134 |
| Duration | 28.94 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 32412 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60131.50518 |
| Minimum | 6086 |
|---|---|
| Maximum | 97903 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 6086 |
|---|---|
| 5-th percentile | 7726.55 |
| Q1 | 45291.75 |
| median | 53394.5 |
| Q3 | 89800.25 |
| 95-th percentile | 96282.45 |
| Maximum | 97903 |
| Range | 91817 |
| Interquartile range (IQR) | 44508.5 |
Descriptive statistics
| Standard deviation | 29953.58618 |
|---|---|
| Coefficient of variation (CV) | 0.4981346481 |
| Kurtosis | -1.310575271 |
| Mean | 60131.50518 |
| Median Absolute Deviation (MAD) | 32382.5 |
| Skewness | -0.2686934461 |
| Sum | 1948982346 |
| Variance | 897217324.9 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 6147 | 1 | < 0.1% |
| 23159 | 1 | < 0.1% |
| 23422 | 1 | < 0.1% |
| 87566 | 1 | < 0.1% |
| 84860 | 1 | < 0.1% |
| 95099 | 1 | < 0.1% |
| 97146 | 1 | < 0.1% |
| 91001 | 1 | < 0.1% |
| 93048 | 1 | < 0.1% |
| 7030 | 1 | < 0.1% |
| Other values (32402) | 32402 |
| Value | Count | Frequency (%) |
| 6086 | 1 | |
| 6087 | 1 | |
| 6088 | 1 | |
| 6089 | 1 | |
| 6090 | 1 | |
| 6091 | 1 | |
| 6092 | 1 | |
| 6093 | 1 | |
| 6094 | 1 | |
| 6095 | 1 |
| Value | Count | Frequency (%) |
| 97903 | 1 | |
| 97902 | 1 | |
| 97901 | 1 | |
| 97900 | 1 | |
| 97899 | 1 | |
| 97898 | 1 | |
| 97897 | 1 | |
| 97896 | 1 | |
| 97895 | 1 | |
| 97894 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 32412 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 19821 | |
| 1 | 12591 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 19821 | |
| 1 | 12591 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 19821 | |
| 1 | 12591 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32412 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 19821 | |
| 1 | 12591 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 32412 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 19821 | |
| 1 | 12591 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 19821 | |
| 1 | 12591 |
| Distinct | 368 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 97.58786869 |
| Minimum | 0 |
|---|---|
| Maximum | 373 |
| Zeros | 1376 |
| Zeros (%) | 4.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 23 |
| median | 76 |
| Q3 | 155 |
| 95-th percentile | 270 |
| Maximum | 373 |
| Range | 373 |
| Interquartile range (IQR) | 132 |
Descriptive statistics
| Standard deviation | 86.50714564 |
|---|---|
| Coefficient of variation (CV) | 0.8864538882 |
| Kurtosis | 0.02388213705 |
| Mean | 97.58786869 |
| Median Absolute Deviation (MAD) | 61 |
| Skewness | 0.8690844332 |
| Sum | 3163018 |
| Variance | 7483.486247 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1376 | 4.2% |
| 1 | 783 | 2.4% |
| 2 | 492 | 1.5% |
| 3 | 403 | 1.2% |
| 4 | 391 | 1.2% |
| 7 | 377 | 1.2% |
| 6 | 366 | 1.1% |
| 5 | 345 | 1.1% |
| 28 | 313 | 1.0% |
| 8 | 290 | 0.9% |
| Other values (358) | 27276 |
| Value | Count | Frequency (%) |
| 0 | 1376 | |
| 1 | 783 | |
| 2 | 492 | 1.5% |
| 3 | 403 | 1.2% |
| 4 | 391 | 1.2% |
| 5 | 345 | 1.1% |
| 6 | 366 | 1.1% |
| 7 | 377 | 1.2% |
| 8 | 290 | 0.9% |
| 9 | 236 | 0.7% |
| Value | Count | Frequency (%) |
| 373 | 27 | |
| 372 | 11 | < 0.1% |
| 368 | 36 | |
| 367 | 2 | < 0.1% |
| 366 | 1 | < 0.1% |
| 365 | 2 | < 0.1% |
| 364 | 13 | < 0.1% |
| 361 | 2 | < 0.1% |
| 359 | 3 | < 0.1% |
| 358 | 3 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| 2017 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 129648 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2017 |
|---|---|
| 2nd row | 2017 |
| 3rd row | 2017 |
| 4th row | 2017 |
| 5th row | 2017 |
Common Values
| Value | Count | Frequency (%) |
| 2017 | 32412 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2017 | 32412 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 32412 | |
| 0 | 32412 | |
| 1 | 32412 | |
| 7 | 32412 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 129648 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 32412 | |
| 0 | 32412 | |
| 1 | 32412 | |
| 7 | 32412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 129648 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 32412 | |
| 0 | 32412 | |
| 1 | 32412 | |
| 7 | 32412 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 129648 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 32412 | |
| 0 | 32412 | |
| 1 | 32412 | |
| 7 | 32412 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| May | |
|---|---|
| April | |
| June | |
| March | |
| July | |
| Other values (3) |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 5.039954338 |
| Min length | 3 |
Characters and Unicode
| Total characters | 163355 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | January |
|---|---|
| 2nd row | January |
| 3rd row | January |
| 4th row | January |
| 5th row | January |
Common Values
| Value | Count | Frequency (%) |
| May | 5262 | |
| April | 4878 | |
| June | 4580 | |
| March | 4277 | |
| July | 3626 | |
| February | 3543 | |
| January | 3150 | |
| August | 3096 |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| may | 5262 | |
| april | 4878 | |
| june | 4580 | |
| march | 4277 | |
| july | 3626 | |
| february | 3543 | |
| january | 3150 | |
| august | 3096 |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 21091 | |
| r | 19391 | |
| a | 19382 | |
| y | 15581 | |
| J | 11356 | 7.0% |
| M | 9539 | 5.8% |
| l | 8504 | 5.2% |
| e | 8123 | 5.0% |
| A | 7974 | 4.9% |
| n | 7730 | 4.7% |
| Other values (9) | 34684 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 130943 | |
| Uppercase Letter | 32412 | 19.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 21091 | |
| r | 19391 | |
| a | 19382 | |
| y | 15581 | |
| l | 8504 | |
| e | 8123 | 6.2% |
| n | 7730 | 5.9% |
| i | 4878 | 3.7% |
| p | 4878 | 3.7% |
| c | 4277 | 3.3% |
| Other values (5) | 17108 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 11356 | |
| M | 9539 | |
| A | 7974 | |
| F | 3543 | 10.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 163355 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 21091 | |
| r | 19391 | |
| a | 19382 | |
| y | 15581 | |
| J | 11356 | 7.0% |
| M | 9539 | 5.8% |
| l | 8504 | 5.2% |
| e | 8123 | 5.0% |
| A | 7974 | 4.9% |
| n | 7730 | 4.7% |
| Other values (9) | 34684 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 163355 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 21091 | |
| r | 19391 | |
| a | 19382 | |
| y | 15581 | |
| J | 11356 | 7.0% |
| M | 9539 | 5.8% |
| l | 8504 | 5.2% |
| e | 8123 | 5.0% |
| A | 7974 | 4.9% |
| n | 7730 | 4.7% |
| Other values (9) | 34684 |
| Distinct | 35 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.80405405 |
| Minimum | 1 |
|---|---|
| Maximum | 35 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 10 |
| median | 18 |
| Q3 | 25 |
| 95-th percentile | 33 |
| Maximum | 35 |
| Range | 34 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 9.17738444 |
|---|---|
| Coefficient of variation (CV) | 0.5154659951 |
| Kurtosis | -0.9762359697 |
| Mean | 17.80405405 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -0.001657087894 |
| Sum | 577065 |
| Variance | 84.22438516 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 1297 | 4.0% |
| 18 | 1271 | 3.9% |
| 20 | 1261 | 3.9% |
| 15 | 1218 | 3.8% |
| 21 | 1170 | 3.6% |
| 22 | 1161 | 3.6% |
| 14 | 1101 | 3.4% |
| 23 | 1098 | 3.4% |
| 19 | 1080 | 3.3% |
| 8 | 1054 | 3.3% |
| Other values (25) | 20701 |
| Value | Count | Frequency (%) |
| 1 | 703 | |
| 2 | 720 | |
| 3 | 731 | |
| 4 | 780 | |
| 5 | 683 | |
| 6 | 658 | |
| 7 | 989 | |
| 8 | 1054 | |
| 9 | 953 | |
| 10 | 962 |
| Value | Count | Frequency (%) |
| 35 | 511 | |
| 34 | 625 | |
| 33 | 791 | |
| 32 | 643 | |
| 31 | 733 | |
| 30 | 783 | |
| 29 | 711 | |
| 28 | 952 | |
| 27 | 859 | |
| 26 | 1021 |
arrival_date_day_of_month
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.65694804 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 15.5 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.766429465 |
|---|---|
| Coefficient of variation (CV) | 0.5599066587 |
| Kurtosis | -1.187567447 |
| Mean | 15.65694804 |
| Median Absolute Deviation (MAD) | 7.5 |
| Skewness | 0.003528410818 |
| Sum | 507473 |
| Variance | 76.85028556 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 1317 | 4.1% |
| 9 | 1227 | 3.8% |
| 2 | 1200 | 3.7% |
| 28 | 1139 | 3.5% |
| 25 | 1127 | 3.5% |
| 3 | 1121 | 3.5% |
| 27 | 1119 | 3.5% |
| 24 | 1116 | 3.4% |
| 14 | 1115 | 3.4% |
| 19 | 1111 | 3.4% |
| Other values (21) | 20820 |
| Value | Count | Frequency (%) |
| 1 | 1057 | |
| 2 | 1200 | |
| 3 | 1121 | |
| 4 | 923 | |
| 5 | 1099 | |
| 6 | 1076 | |
| 7 | 878 | |
| 8 | 1066 | |
| 9 | 1227 | |
| 10 | 1038 |
| Value | Count | Frequency (%) |
| 31 | 564 | |
| 30 | 860 | |
| 29 | 916 | |
| 28 | 1139 | |
| 27 | 1119 | |
| 26 | 1110 | |
| 25 | 1127 | |
| 24 | 1116 | |
| 23 | 1059 | |
| 22 | 917 |
stays_in_weekend_nights
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| 0 | |
|---|---|
| 2 | |
| 1 | |
| 3 | 101 |
| 4 | 70 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 32412 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 13915 | |
| 2 | 9221 | |
| 1 | 9105 | |
| 3 | 101 | 0.3% |
| 4 | 70 | 0.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 13915 | |
| 2 | 9221 | |
| 1 | 9105 | |
| 3 | 101 | 0.3% |
| 4 | 70 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13915 | |
| 2 | 9221 | |
| 1 | 9105 | |
| 3 | 101 | 0.3% |
| 4 | 70 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32412 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 13915 | |
| 2 | 9221 | |
| 1 | 9105 | |
| 3 | 101 | 0.3% |
| 4 | 70 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 32412 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 13915 | |
| 2 | 9221 | |
| 1 | 9105 | |
| 3 | 101 | 0.3% |
| 4 | 70 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 13915 | |
| 2 | 9221 | |
| 1 | 9105 | |
| 3 | 101 | 0.3% |
| 4 | 70 | 0.2% |
stays_in_week_nights
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.34009009 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1937 |
| Zeros (%) | 6.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.375169854 |
|---|---|
| Coefficient of variation (CV) | 0.5876568 |
| Kurtosis | -0.421009231 |
| Mean | 2.34009009 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.451786677 |
| Sum | 75847 |
| Variance | 1.891092127 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 9004 | |
| 1 | 8038 | |
| 3 | 7326 | |
| 4 | 2975 | 9.2% |
| 5 | 2869 | 8.9% |
| 0 | 1937 | 6.0% |
| 6 | 263 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 1937 | 6.0% |
| 1 | 8038 | |
| 2 | 9004 | |
| 3 | 7326 | |
| 4 | 2975 | 9.2% |
| 5 | 2869 | 8.9% |
| 6 | 263 | 0.8% |
| Value | Count | Frequency (%) |
| 6 | 263 | 0.8% |
| 5 | 2869 | 8.9% |
| 4 | 2975 | 9.2% |
| 3 | 7326 | |
| 2 | 9004 | |
| 1 | 8038 | |
| 0 | 1937 | 6.0% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| 2.0 | |
|---|---|
| 1.0 | |
| 3.0 | 1817 |
| 0.0 | 69 |
| 4.0 | 9 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 97236 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 2.0 |
| 4th row | 1.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 24237 | |
| 1.0 | 6280 | 19.4% |
| 3.0 | 1817 | 5.6% |
| 0.0 | 69 | 0.2% |
| 4.0 | 9 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2.0 | 24237 | |
| 1.0 | 6280 | 19.4% |
| 3.0 | 1817 | 5.6% |
| 0.0 | 69 | 0.2% |
| 4.0 | 9 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 32481 | |
| . | 32412 | |
| 2 | 24237 | |
| 1 | 6280 | 6.5% |
| 3 | 1817 | 1.9% |
| 4 | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64824 | |
| Other Punctuation | 32412 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 32481 | |
| 2 | 24237 | |
| 1 | 6280 | 9.7% |
| 3 | 1817 | 2.8% |
| 4 | 9 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 32412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 97236 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 32481 | |
| . | 32412 | |
| 2 | 24237 | |
| 1 | 6280 | 6.5% |
| 3 | 1817 | 1.9% |
| 4 | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 97236 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 32481 | |
| . | 32412 | |
| 2 | 24237 | |
| 1 | 6280 | 6.5% |
| 3 | 1817 | 1.9% |
| 4 | 9 | < 0.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| 0.0 | |
|---|---|
| 1.0 | 1394 |
| 2.0 | 653 |
| 3.0 | 5 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 97236 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 30360 | |
| 1.0 | 1394 | 4.3% |
| 2.0 | 653 | 2.0% |
| 3.0 | 5 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 30360 | |
| 1.0 | 1394 | 4.3% |
| 2.0 | 653 | 2.0% |
| 3.0 | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 62772 | |
| . | 32412 | |
| 1 | 1394 | 1.4% |
| 2 | 653 | 0.7% |
| 3 | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64824 | |
| Other Punctuation | 32412 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 62772 | |
| 1 | 1394 | 2.2% |
| 2 | 653 | 1.0% |
| 3 | 5 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 32412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 97236 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 62772 | |
| . | 32412 | |
| 1 | 1394 | 1.4% |
| 2 | 653 | 0.7% |
| 3 | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 97236 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 62772 | |
| . | 32412 | |
| 1 | 1394 | 1.4% |
| 2 | 653 | 0.7% |
| 3 | 5 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| 0.0 | |
|---|---|
| 1.0 | 171 |
| 2.0 | 4 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 97236 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 32237 | |
| 1.0 | 171 | 0.5% |
| 2.0 | 4 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 32237 | |
| 1.0 | 171 | 0.5% |
| 2.0 | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 64649 | |
| . | 32412 | |
| 1 | 171 | 0.2% |
| 2 | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64824 | |
| Other Punctuation | 32412 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 64649 | |
| 1 | 171 | 0.3% |
| 2 | 4 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 32412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 97236 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 64649 | |
| . | 32412 | |
| 1 | 171 | 0.2% |
| 2 | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 97236 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 64649 | |
| . | 32412 | |
| 1 | 171 | 0.2% |
| 2 | 4 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| BB | |
|---|---|
| SC | |
| HB | 2399 |
| SC | 258 |
| FB | 36 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.944279896 |
| Min length | 2 |
Characters and Unicode
| Total characters | 289902 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BB |
|---|---|
| 2nd row | BB |
| 3rd row | BB |
| 4th row | BB |
| 5th row | BB |
Common Values
| Value | Count | Frequency (%) |
| BB | 24684 | |
| SC | 5035 | 15.5% |
| HB | 2399 | 7.4% |
| SC | 258 | 0.8% |
| FB | 36 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| bb | 24684 | |
| sc | 5293 | 16.3% |
| hb | 2399 | 7.4% |
| fb | 36 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 225078 | ||
| B | 51803 | 17.9% |
| S | 5293 | 1.8% |
| C | 5293 | 1.8% |
| H | 2399 | 0.8% |
| F | 36 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 225078 | |
| Uppercase Letter | 64824 | 22.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 51803 | |
| S | 5293 | 8.2% |
| C | 5293 | 8.2% |
| H | 2399 | 3.7% |
| F | 36 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 225078 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 225078 | |
| Latin | 64824 | 22.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 51803 | |
| S | 5293 | 8.2% |
| C | 5293 | 8.2% |
| H | 2399 | 3.7% |
| F | 36 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 225078 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 289902 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 225078 | ||
| B | 51803 | 17.9% |
| S | 5293 | 1.8% |
| C | 5293 | 1.8% |
| H | 2399 | 0.8% |
| F | 36 | < 0.1% |
| Distinct | 143 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| PRT | |
|---|---|
| GBR | |
| FRA | |
| DEU | |
| ESP | |
| Other values (138) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.985530051 |
| Min length | 2 |
Characters and Unicode
| Total characters | 96767 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | PRT |
|---|---|
| 2nd row | AUT |
| 3rd row | AUT |
| 4th row | PRT |
| 5th row | BEL |
Common Values
| Value | Count | Frequency (%) |
| PRT | 9887 | |
| GBR | 3927 | 12.1% |
| FRA | 3477 | 10.7% |
| DEU | 2378 | 7.3% |
| ESP | 1932 | 6.0% |
| ITA | 1153 | 3.6% |
| IRL | 1060 | 3.3% |
| BEL | 882 | 2.7% |
| BRA | 881 | 2.7% |
| USA | 774 | 2.4% |
| Other values (133) | 6061 |
Length
| Value | Count | Frequency (%) |
| prt | 9887 | |
| gbr | 3927 | 12.1% |
| fra | 3477 | 10.7% |
| deu | 2378 | 7.3% |
| esp | 1932 | 6.0% |
| ita | 1153 | 3.6% |
| irl | 1060 | 3.3% |
| bel | 882 | 2.7% |
| bra | 881 | 2.7% |
| usa | 774 | 2.4% |
| Other values (133) | 6061 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 20621 | |
| P | 12223 | |
| T | 11651 | |
| A | 7271 | 7.5% |
| E | 6275 | 6.5% |
| B | 5776 | 6.0% |
| U | 4465 | 4.6% |
| G | 4185 | 4.3% |
| S | 3795 | 3.9% |
| F | 3660 | 3.8% |
| Other values (16) | 16845 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 96767 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 20621 | |
| P | 12223 | |
| T | 11651 | |
| A | 7271 | 7.5% |
| E | 6275 | 6.5% |
| B | 5776 | 6.0% |
| U | 4465 | 4.6% |
| G | 4185 | 4.3% |
| S | 3795 | 3.9% |
| F | 3660 | 3.8% |
| Other values (16) | 16845 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 96767 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 20621 | |
| P | 12223 | |
| T | 11651 | |
| A | 7271 | 7.5% |
| E | 6275 | 6.5% |
| B | 5776 | 6.0% |
| U | 4465 | 4.6% |
| G | 4185 | 4.3% |
| S | 3795 | 3.9% |
| F | 3660 | 3.8% |
| Other values (16) | 16845 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 96767 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 20621 | |
| P | 12223 | |
| T | 11651 | |
| A | 7271 | 7.5% |
| E | 6275 | 6.5% |
| B | 5776 | 6.0% |
| U | 4465 | 4.6% |
| G | 4185 | 4.3% |
| S | 3795 | 3.9% |
| F | 3660 | 3.8% |
| Other values (16) | 16845 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| TA/TO | |
|---|---|
| Direct | |
| Corporate | 1602 |
| GDS | 85 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.304825373 |
| Min length | 3 |
Characters and Unicode
| Total characters | 171940 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TA/TO |
|---|---|
| 2nd row | TA/TO |
| 3rd row | TA/TO |
| 4th row | TA/TO |
| 5th row | TA/TO |
Common Values
| Value | Count | Frequency (%) |
| TA/TO | 27083 | |
| Direct | 3642 | 11.2% |
| Corporate | 1602 | 4.9% |
| GDS | 85 | 0.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| ta/to | 27083 | |
| direct | 3642 | 11.2% |
| corporate | 1602 | 4.9% |
| gds | 85 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 54166 | |
| A | 27083 | |
| / | 27083 | |
| O | 27083 | |
| r | 6846 | 4.0% |
| e | 5244 | 3.0% |
| t | 5244 | 3.0% |
| D | 3727 | 2.2% |
| i | 3642 | 2.1% |
| c | 3642 | 2.1% |
| Other values (6) | 8180 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 113831 | |
| Lowercase Letter | 31026 | 18.0% |
| Other Punctuation | 27083 | 15.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 6846 | |
| e | 5244 | |
| t | 5244 | |
| i | 3642 | |
| c | 3642 | |
| o | 3204 | |
| p | 1602 | 5.2% |
| a | 1602 | 5.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 54166 | |
| A | 27083 | |
| O | 27083 | |
| D | 3727 | 3.3% |
| C | 1602 | 1.4% |
| G | 85 | 0.1% |
| S | 85 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 27083 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 144857 | |
| Common | 27083 | 15.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 54166 | |
| A | 27083 | |
| O | 27083 | |
| r | 6846 | 4.7% |
| e | 5244 | 3.6% |
| t | 5244 | 3.6% |
| D | 3727 | 2.6% |
| i | 3642 | 2.5% |
| c | 3642 | 2.5% |
| o | 3204 | 2.2% |
| Other values (5) | 4976 | 3.4% |
Common
| Value | Count | Frequency (%) |
| / | 27083 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 171940 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 54166 | |
| A | 27083 | |
| / | 27083 | |
| O | 27083 | |
| r | 6846 | 4.0% |
| e | 5244 | 3.0% |
| t | 5244 | 3.0% |
| D | 3727 | 2.2% |
| i | 3642 | 2.1% |
| c | 3642 | 2.1% |
| Other values (6) | 8180 | 4.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| 0 | |
|---|---|
| 1 | 1017 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 32412 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 31395 | |
| 1 | 1017 | 3.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 31395 | |
| 1 | 1017 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 31395 | |
| 1 | 1017 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32412 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 31395 | |
| 1 | 1017 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 32412 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 31395 | |
| 1 | 1017 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 31395 | |
| 1 | 1017 | 3.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.01160064174 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 32186 |
| Zeros (%) | 99.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1804726207 |
|---|---|
| Coefficient of variation (CV) | 15.55712389 |
| Kurtosis | 681.4587898 |
| Mean | 0.01160064174 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.76463483 |
| Sum | 376 |
| Variance | 0.03257036681 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 32186 | |
| 1 | 165 | 0.5% |
| 2 | 29 | 0.1% |
| 6 | 15 | < 0.1% |
| 4 | 10 | < 0.1% |
| 3 | 6 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 32186 | |
| 1 | 165 | 0.5% |
| 2 | 29 | 0.1% |
| 3 | 6 | < 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 15 | < 0.1% |
| 5 | 1 | < 0.1% |
| 4 | 10 | < 0.1% |
| 3 | 6 | < 0.1% |
| 2 | 29 | 0.1% |
| 1 | 165 | 0.5% |
| 0 | 32186 |
previous_bookings_not_canceled
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 46 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1714179933 |
| Minimum | 0 |
|---|---|
| Maximum | 72 |
| Zeros | 31362 |
| Zeros (%) | 96.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 72 |
| Range | 72 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.875170138 |
|---|---|
| Coefficient of variation (CV) | 10.93916748 |
| Kurtosis | 722.4618089 |
| Mean | 0.1714179933 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.46652365 |
| Sum | 5556 |
| Variance | 3.516263047 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 31362 | |
| 1 | 424 | 1.3% |
| 2 | 161 | 0.5% |
| 3 | 87 | 0.3% |
| 4 | 59 | 0.2% |
| 5 | 46 | 0.1% |
| 6 | 38 | 0.1% |
| 7 | 34 | 0.1% |
| 8 | 24 | 0.1% |
| 10 | 22 | 0.1% |
| Other values (36) | 155 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 31362 | |
| 1 | 424 | 1.3% |
| 2 | 161 | 0.5% |
| 3 | 87 | 0.3% |
| 4 | 59 | 0.2% |
| 5 | 46 | 0.1% |
| 6 | 38 | 0.1% |
| 7 | 34 | 0.1% |
| 8 | 24 | 0.1% |
| 9 | 19 | 0.1% |
| Value | Count | Frequency (%) |
| 72 | 1 | |
| 71 | 1 | |
| 70 | 1 | |
| 69 | 1 | |
| 68 | 1 | |
| 67 | 1 | |
| 66 | 1 | |
| 65 | 1 | |
| 64 | 1 | |
| 63 | 1 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| A | |
|---|---|
| D | |
| E | 1644 |
| F | 503 |
| G | 278 |
| Other values (2) | 393 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 16 |
| Min length | 16 |
Characters and Unicode
| Total characters | 518592 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 23471 | |
| D | 6123 | 18.9% |
| E | 1644 | 5.1% |
| F | 503 | 1.6% |
| G | 278 | 0.9% |
| C | 201 | 0.6% |
| B | 192 | 0.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| a | 23471 | |
| d | 6123 | 18.9% |
| e | 1644 | 5.1% |
| f | 503 | 1.6% |
| g | 278 | 0.9% |
| c | 201 | 0.6% |
| b | 192 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 486180 | ||
| A | 23471 | 4.5% |
| D | 6123 | 1.2% |
| E | 1644 | 0.3% |
| F | 503 | 0.1% |
| G | 278 | 0.1% |
| C | 201 | < 0.1% |
| B | 192 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 486180 | |
| Uppercase Letter | 32412 | 6.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 23471 | |
| D | 6123 | 18.9% |
| E | 1644 | 5.1% |
| F | 503 | 1.6% |
| G | 278 | 0.9% |
| C | 201 | 0.6% |
| B | 192 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 486180 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 486180 | |
| Latin | 32412 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 23471 | |
| D | 6123 | 18.9% |
| E | 1644 | 5.1% |
| F | 503 | 1.6% |
| G | 278 | 0.9% |
| C | 201 | 0.6% |
| B | 192 | 0.6% |
Common
| Value | Count | Frequency (%) |
| 486180 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 518592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 486180 | ||
| A | 23471 | 4.5% |
| D | 6123 | 1.2% |
| E | 1644 | 0.3% |
| F | 503 | 0.1% |
| G | 278 | 0.1% |
| C | 201 | < 0.1% |
| B | 192 | < 0.1% |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2167407133 |
| Minimum | 0 |
|---|---|
| Maximum | 18 |
| Zeros | 27745 |
| Zeros (%) | 85.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6405505718 |
|---|---|
| Coefficient of variation (CV) | 2.955377243 |
| Kurtosis | 64.62703588 |
| Mean | 0.2167407133 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.374420631 |
| Sum | 7025 |
| Variance | 0.410305035 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 27745 | |
| 1 | 3065 | 9.5% |
| 2 | 1160 | 3.6% |
| 3 | 268 | 0.8% |
| 4 | 117 | 0.4% |
| 5 | 29 | 0.1% |
| 6 | 16 | < 0.1% |
| 7 | 5 | < 0.1% |
| 16 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 27745 | |
| 1 | 3065 | 9.5% |
| 2 | 1160 | 3.6% |
| 3 | 268 | 0.8% |
| 4 | 117 | 0.4% |
| 5 | 29 | 0.1% |
| 6 | 16 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 18 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 5 | < 0.1% |
| 6 | 16 | |
| 5 | 29 |
| Distinct | 75 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2623411082 |
| Minimum | 0 |
|---|---|
| Maximum | 223 |
| Zeros | 32235 |
| Zeros (%) | 99.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 223 |
| Range | 223 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.733026518 |
|---|---|
| Coefficient of variation (CV) | 18.04149777 |
| Kurtosis | 748.8413878 |
| Mean | 0.2623411082 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.80627415 |
| Sum | 8503 |
| Variance | 22.40154002 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 32235 | |
| 59 | 6 | < 0.1% |
| 71 | 6 | < 0.1% |
| 60 | 6 | < 0.1% |
| 25 | 6 | < 0.1% |
| 4 | 5 | < 0.1% |
| 14 | 5 | < 0.1% |
| 5 | 5 | < 0.1% |
| 46 | 5 | < 0.1% |
| 28 | 5 | < 0.1% |
| Other values (65) | 128 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 32235 | |
| 1 | 3 | < 0.1% |
| 2 | 2 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 223 | 1 | < 0.1% |
| 185 | 2 | |
| 183 | 1 | < 0.1% |
| 175 | 1 | < 0.1% |
| 165 | 1 | < 0.1% |
| 154 | 2 | |
| 122 | 1 | < 0.1% |
| 121 | 1 | < 0.1% |
| 117 | 1 | < 0.1% |
| 113 | 4 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| Transient | |
|---|---|
| Transient-Party | |
| Contract | 359 |
| Group | 165 |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 9.788072319 |
| Min length | 5 |
Characters and Unicode
| Total characters | 317251 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Transient |
|---|---|
| 2nd row | Transient |
| 3rd row | Transient |
| 4th row | Transient |
| 5th row | Transient |
Common Values
| Value | Count | Frequency (%) |
| Transient | 27461 | |
| Transient-Party | 4427 | 13.7% |
| Contract | 359 | 1.1% |
| Group | 165 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| transient | 27461 | |
| transient-party | 4427 | 13.7% |
| contract | 359 | 1.1% |
| group | 165 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 64135 | |
| t | 37033 | |
| r | 36839 | |
| a | 36674 | |
| T | 31888 | |
| s | 31888 | |
| i | 31888 | |
| e | 31888 | |
| y | 4427 | 1.4% |
| - | 4427 | 1.4% |
| Other values (7) | 6164 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 275985 | |
| Uppercase Letter | 36839 | 11.6% |
| Dash Punctuation | 4427 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 64135 | |
| t | 37033 | |
| r | 36839 | |
| a | 36674 | |
| s | 31888 | |
| i | 31888 | |
| e | 31888 | |
| y | 4427 | 1.6% |
| o | 524 | 0.2% |
| c | 359 | 0.1% |
| Other values (2) | 330 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 31888 | |
| P | 4427 | 12.0% |
| C | 359 | 1.0% |
| G | 165 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4427 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 312824 | |
| Common | 4427 | 1.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 64135 | |
| t | 37033 | |
| r | 36839 | |
| a | 36674 | |
| T | 31888 | |
| s | 31888 | |
| i | 31888 | |
| e | 31888 | |
| y | 4427 | 1.4% |
| P | 4427 | 1.4% |
| Other values (6) | 1737 | 0.6% |
Common
| Value | Count | Frequency (%) |
| - | 4427 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 317251 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 64135 | |
| t | 37033 | |
| r | 36839 | |
| a | 36674 | |
| T | 31888 | |
| s | 31888 | |
| i | 31888 | |
| e | 31888 | |
| y | 4427 | 1.4% |
| - | 4427 | 1.4% |
| Other values (7) | 6164 | 1.9% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 253.3 KiB |
| 0 | |
|---|---|
| 1 | 1468 |
| 2 | 6 |
| 8 | 2 |
| 3 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 32412 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 30935 | |
| 1 | 1468 | 4.5% |
| 2 | 6 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 30935 | |
| 1 | 1468 | 4.5% |
| 2 | 6 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 30935 | |
| 1 | 1468 | 4.5% |
| 2 | 6 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32412 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 30935 | |
| 1 | 1468 | 4.5% |
| 2 | 6 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 32412 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 30935 | |
| 1 | 1468 | 4.5% |
| 2 | 6 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 30935 | |
| 1 | 1468 | 4.5% |
| 2 | 6 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6577810687 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 17338 |
| Zeros (%) | 53.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8343410618 |
|---|---|
| Coefficient of variation (CV) | 1.268417565 |
| Kurtosis | 1.170586881 |
| Mean | 0.6577810687 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.214905428 |
| Sum | 21320 |
| Variance | 0.6961250074 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 17338 | |
| 1 | 10037 | |
| 2 | 3988 | 12.3% |
| 3 | 907 | 2.8% |
| 4 | 124 | 0.4% |
| 5 | 18 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 17338 | |
| 1 | 10037 | |
| 2 | 3988 | 12.3% |
| 3 | 907 | 2.8% |
| 4 | 124 | 0.4% |
| 5 | 18 | 0.1% |
| Value | Count | Frequency (%) |
| 5 | 18 | 0.1% |
| 4 | 124 | 0.4% |
| 3 | 907 | 2.8% |
| 2 | 3988 | 12.3% |
| 1 | 10037 | |
| 0 | 17338 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.207978526 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 253.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 7 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.73868266 |
|---|---|
| Coefficient of variation (CV) | 0.5419870009 |
| Kurtosis | 0.3797450506 |
| Mean | 3.207978526 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.8445081871 |
| Sum | 103977 |
| Variance | 3.023017394 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 8395 | |
| 2 | 6760 | |
| 4 | 5828 | |
| 1 | 5465 | |
| 5 | 2409 | 7.4% |
| 7 | 2284 | 7.0% |
| 6 | 939 | 2.9% |
| 8 | 215 | 0.7% |
| 9 | 62 | 0.2% |
| 10 | 55 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 5465 | |
| 2 | 6760 | |
| 3 | 8395 | |
| 4 | 5828 | |
| 5 | 2409 | 7.4% |
| 6 | 939 | 2.9% |
| 7 | 2284 | 7.0% |
| 8 | 215 | 0.7% |
| 9 | 62 | 0.2% |
| 10 | 55 | 0.2% |
| Value | Count | Frequency (%) |
| 10 | 55 | 0.2% |
| 9 | 62 | 0.2% |
| 8 | 215 | 0.7% |
| 7 | 2284 | 7.0% |
| 6 | 939 | 2.9% |
| 5 | 2409 | 7.4% |
| 4 | 5828 | |
| 3 | 8395 | |
| 2 | 6760 | |
| 1 | 5465 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| id | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | booking_changes | days_in_waiting_list | customer_type | required_car_parking_spaces | total_of_special_requests | total_nights | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 6086 | 1 | 74.0 | 2017 | January | 1 | 1 | 1 | 0 | 2.0 | 0.0 | 0.0 | BB | PRT | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 0 | 1 |
| 1 | 6087 | 1 | 62.0 | 2017 | January | 1 | 1 | 2 | 2 | 2.0 | 0.0 | 0.0 | BB | AUT | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 1 | 4 |
| 2 | 6088 | 1 | 62.0 | 2017 | January | 1 | 1 | 2 | 2 | 2.0 | 0.0 | 0.0 | BB | AUT | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 1 | 4 |
| 3 | 6089 | 1 | 71.0 | 2017 | January | 1 | 1 | 2 | 2 | 1.0 | 0.0 | 0.0 | BB | PRT | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 1 | 4 |
| 4 | 6090 | 1 | 172.0 | 2017 | January | 1 | 1 | 2 | 5 | 2.0 | 0.0 | 0.0 | BB | BEL | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 0 | 7 |
| 5 | 6091 | 1 | 52.0 | 2017 | January | 1 | 1 | 2 | 5 | 1.0 | 0.0 | 0.0 | BB | DEU | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 0 | 7 |
| 6 | 6092 | 1 | 143.0 | 2017 | January | 1 | 2 | 1 | 1 | 2.0 | 0.0 | 0.0 | BB | BRA | Direct | 0 | 0 | 0 | A | 1 | 0 | Transient | 0 | 1 | 2 |
| 7 | 6093 | 1 | 21.0 | 2017 | January | 1 | 2 | 1 | 3 | 2.0 | 0.0 | 0.0 | BB | BRA | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 1 | 4 |
| 8 | 6094 | 1 | 89.0 | 2017 | January | 1 | 2 | 1 | 3 | 2.0 | 0.0 | 0.0 | BB | GBR | TA/TO | 0 | 0 | 0 | E | 0 | 0 | Transient | 0 | 0 | 4 |
| 9 | 6095 | 1 | 48.0 | 2017 | January | 1 | 2 | 1 | 4 | 2.0 | 0.0 | 0.0 | BB | PRT | Direct | 0 | 0 | 0 | A | 1 | 0 | Transient | 0 | 2 | 5 |
Last rows
| id | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | booking_changes | days_in_waiting_list | customer_type | required_car_parking_spaces | total_of_special_requests | total_nights | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 32402 | 97894 | 0 | 185.0 | 2017 | August | 35 | 30 | 1 | 4 | 2.0 | 0.0 | 0.0 | SC | CHE | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 1 | 5 |
| 32403 | 97895 | 0 | 247.0 | 2017 | August | 35 | 31 | 1 | 3 | 2.0 | 0.0 | 0.0 | BB | GBR | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 0 | 4 |
| 32404 | 97896 | 0 | 109.0 | 2017 | August | 35 | 31 | 1 | 3 | 2.0 | 0.0 | 0.0 | BB | GBR | TA/TO | 0 | 0 | 0 | D | 0 | 0 | Transient | 0 | 1 | 4 |
| 32405 | 97897 | 0 | 44.0 | 2017 | August | 35 | 31 | 1 | 3 | 2.0 | 0.0 | 0.0 | SC | DEU | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 1 | 4 |
| 32406 | 97898 | 0 | 188.0 | 2017 | August | 35 | 31 | 2 | 3 | 2.0 | 0.0 | 0.0 | BB | DEU | Direct | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 0 | 5 |
| 32407 | 97899 | 0 | 164.0 | 2017 | August | 35 | 31 | 2 | 4 | 2.0 | 0.0 | 0.0 | BB | DEU | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 0 | 6 |
| 32408 | 97900 | 0 | 21.0 | 2017 | August | 35 | 30 | 2 | 5 | 2.0 | 0.0 | 0.0 | BB | BEL | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 2 | 7 |
| 32409 | 97901 | 0 | 23.0 | 2017 | August | 35 | 30 | 2 | 5 | 2.0 | 0.0 | 0.0 | BB | BEL | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 0 | 7 |
| 32410 | 97902 | 0 | 34.0 | 2017 | August | 35 | 31 | 2 | 5 | 2.0 | 0.0 | 0.0 | BB | DEU | TA/TO | 0 | 0 | 0 | D | 0 | 0 | Transient | 0 | 4 | 7 |
| 32411 | 97903 | 0 | 109.0 | 2017 | August | 35 | 31 | 2 | 5 | 2.0 | 0.0 | 0.0 | BB | GBR | TA/TO | 0 | 0 | 0 | A | 0 | 0 | Transient | 0 | 0 | 7 |